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Abstract. We have improved an algorithm generating synchronizing 
automata with a large length of the shortest reset words. This has been 
done by refining some known results concerning bounds on the reset 
length. Our improvements make possible to consider a number of conjec- 
tures and open questions concerning synchronizing automata, checking 
them for automata with a small number of states and discussing the re- 
sults. In particular, we have verified the Cerny conjecture for all binary 
automata with at most 12 states, and all ternary automata with at most 
8 states. 


1 Introduction 


A deterministic finite automaton A is (Q, X, ô}, where Q is the set of the states, 
X is the input alphabet, and 6: Q x X > Q is the (complete) transition function. 
Throughout the paper, by n we denote the number of states |Q]. If |X| = k then 
A is called k-ary. The transition function 6 is naturally extended to a function 
22 x ©* — 2%. The image of S C Q under the action of a word w € X* is 
Sw = {0(q,w) | q E€ S}. The rank of a word w € X* is |Qw], and the rank of A 
is the minimal rank of a word over A. For a non-empty subset ©” C X, we may 
define the automaton A’ = (Q, =”, 6’), where 0’ is the natural restriction of ô to 
X”. In such a case A is called an extension of A’. The automata of rank 1 are 
called synchronizing, and each word w with |Qw| = 1 is called a synchronizing 
(or reset) word for A. An automaton is irreducibly synchronizing if it is not an 
extension of a synchronizing automaton over a smaller alphabet. 

We are interested in the length of a shortest reset word for A (there may 
be more than one word of the same shortest length). We call it the reset length 
of A. The famous Černý conjecture states that every synchronizing automaton 
A with n states has a reset word of length < (n — 1)? [9]. This conjecture 
was formulated by Černý in 1969 and is considered the longest-standing open 
problem in combinatorial theory of finite automata. So far, the conjecture has 
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been proved only for a few special classes of automata, and a cubic upper bound 
(n — n)/6 — 1 [19] has been established, which was not improved for over 30 
years (see |15, 24] for excellent surveys). The bound (n — 1)? is met for every n 
by the Cerny automata [9], which is the only known infinite series of automata 
meeting this bound (besides that, there are 8 known particular examples with 
n < 6 states [22] also meeting the bound). 

There were several efforts to check computationally the conjecture for all 
automata with a small number of states. In particular, Ananichev, Gusev, and 
Volkov [3,4] have checked all binary automata with at most n = 9 states, and 
the checking for all automata with at most n = 10 states was reported in [22]. 
In [16], using a dedicated algorithm, we have verified the conjecture for all binary 
automata with n < 11 states. 

In this paper, first we describe improvements to our algorithm from [16], 
which are aimed at making possible verifying the conjectures for larger au- 
tomata. While these are results of a rather technical nature, and may be not 
very interesting from theoretical point of view, they make possible to restrict 
the computation process to much smaller class of relevant automata, and thus 
to consider also automata with a larger number of states. 

We extend verification of the Cerny conjecture up to 12 states and present 
an extensive experimental study on important problems and conjectures closely 
related to upper bounds on reset lengths. We consider known conjectures, and 
restate or state new ones basing on our experiments. Most of them imply an 
improvement for the general cubic bound, and hence are very hard but stand as 
possible ways to attack the main problem. All of the conjectures are experimen- 
tally confirmed for automata with a small number of states and/or letters. 


2 Reset Lengths of Extensions 


In this section we describe two theoretical results we apply in the improved algo- 
rithm. We are interested mainly in estimating the reset length of synchronizing 
automata that arise as extensions of non-synchronizing automata by one letter. 
In some cases, we are able to provide better upper bounds than the general 
bound (n? — n)/6 — 1 [19]. 

In particular, we search for synchronizing automata with relatively large 
reset length. We improve the algorithm from [16] which takes a set of (k — 1)- 
ary automata with n states and generates all their nonisomorphic one-letter 
extensions. To perform an exhaustive search over the k-ary automata with n 
states with some property, we need to progressively run the algorithm /—1 times 
starting from the complete set of non-isomorphic unary automata. However, in 
each run, if we know that any extension of an automaton A cannot have the 
desired property, we can safely drop A from further computations. Since the 
number of generated automata grows rapidly, suitable knowledge saves a lot of 
computational time and extends the class of the automata investigated. The 
technical details of the algorithm and proofs can be found in [17]. 
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A subset M C Q of the states is called compressible, if there is a word w 
such that |Mw| < |M|. Let A = (Q, 2,6) be a finite automaton. We say that 
a sequence (M;i, xi, yi), (1 < i < £) of m-subsets (subsets of size m) M; of Q 
and pairs of states x;, y; E Q is an m-subset Frankl-Pin sequence if the following 
conditions are satisfied 


1. zi yi E M; for 1 < i< &; 
2. either x; or y; is not in M; for alll <j < i< £. 


If all the pairs {x;, yi} belong to a set P of pairs, we will say that this sequence 
is over P. Given a set P of compressible pairs, by the synchronizing height h(P) 
of P we mean the minimal h such that for each pair {x,y} € P there exists a 
word w of length h such that cw = yw. 

It is known that a shortest word compressing M cannot be longer than the 
length of the Frankl-Pin sequence starting from M [12] (this, in fact, is used to 
obtain the bound (n? —n)/6 mentioned above). Our first technical improvement 
is that if the synchronizing height is smaller than the maximal length of a Frankl- 
Pin sequence over P, then we have 


Theorem 1. Let P be a set of compressible pairs in A, h(P) the synchronizing 
height of P, and p(P) the maximal length of a Frankl-Pin sequence over P. 
Then, for every compressible m-subset M of Q (2 < m < n), there is a word 
compressing M whose length does not exceed 


(PF?) -P P) 

This result improves the estimation in [12] by the negative summand (p(P) — 
h(P)). It is to be combined and compared with the result by J.-E. Pin [18] saying 
that if w is a word of rank r and there exists a word of rank < r—1, then there is 
such a word of length < 2|w|-+-n—r+1. There are other results of this kind that 
can be used for providing bounds for extensions, as that in [6]. Unfortunately, 
for small values of n that are within our considerations, this does not overcome 
the bound from Theorem 1. 

Recall that an automaton A = (Q, X,ô) is one-cluster, if it has a letter 
a € X such that for every pair q,s € Q there are i,j > 1 such that qa’ = sa’. 
This means that the graph of the transformation induced by a is connected. In 
particular, it has a unique cycle C C Q with the property Ca’ = C for every 
i > 0, and there is £ > 0 such that Qa‘ = C. The least such £ is called the level 
of A. Steinberg [21] proved that if the length m of the cycle is prime, then the 
one-cluster automaton A has a reset word of length at most 


n—m+1+ 204+ (m—2)(n+ £8). (1) 


We generalize this result to arbitrary lengths and get an additional negative 
summand. We refine the proof of Steinberg [21] and the summand is expressed 
in algebraic terms of the proof. Therefore, to present the result we have to recall 
basic notations from [21]. 
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Given a one cluster automaton with the notation as above, we consider the 
matrix representation 7: X* — M,,(Q) defined by m(w)g,r = 1 if qw = r, and 
0, otherwise. Given S C Q we define [S] to be the characteristic row vector of 
S in Q”, [S]* its transpose, and ys = [S]7 — (|S|/|C|)[Q]7. By wys we denote 
the product of corresponding matrices; in particular, the word w represents the 
matrix m(w), and the product is a vector in the space Q”. We consider the 
subspace Ws = Span{a’tJys € Q” | 0 < j < m—1} (cf. [21]), and the cyclic 
period qs of S, understood as the least number q such that Sa? = S. Now, we 
define D*(m,k) to be the minimal value of m — qs + dim Ws taken over all 
vectors S with |S| = k. Then we prove the following: 


Theorem 2. Let A = (Q, 27,6) be a synchronizing automaton with n states, 
such that there exists a word w of length s inducing a one-cluster transformation 
with level £ and cycle C of length m > 1. Then A has a reset word of length at 


most 
m-1 


s(€+m —2)(m—1) + (n+1)(m—1) +s- $` D*(m,k). 


k=1 


One can demonstrate that this results generalizes and improves earlier bounds 
in [8, 21], and a careful estimation of the summand D*(m, k) yields the currently 
best general bound for reset lengths of one-cluster automata: 


Corollary 3. A synchronizing one-cluster automaton A with n states and the 
cycle of length m has a reset word of length at most 


+3 


2nm — 4mln +2m—n+1 (2) 
Nevertheless, for small values of m we can compute the exact values of D* (m, k), 
and this yields considerably better bounds than the general estimation above. 


3 Experiments and Conjectures 


In this section we discuss the results of our experiments with the improved 
algorithm concerning various conjectures and open problems in the area. 


The Černý conjecture. We have verified the Cerny conjecture for several 
cases. In particular, we confirmed it for all binary automata with n < 12 states, 
and for all ternary automata with n < 8. 

Verifying the Černý conjecture for binary automata with n = 12 states was 
the most difficult computation that we have performed here. The total time 
of a single processor core spent for this computation was about 100 years. We 
performed this on a grid in parallel using mostly about 200 cores of Quad-Core 
AMD Opteron(tm) Processor 8350, 2.0 GHz. The total number of automata 
generated by our algorithm in this case was about 1015. 


Experiments with Synchronizing Automata 5 


For ternary automata with n = 8 states the computation took 1.25 years of 
a single processor core, and we had to generate and check about 2.1 x 101° au- 
tomata. One may compare these numbers with the numbers of non-isomorphic 
initially connected automata that one would need to generate applying the tech- 
nique described in [3]. The corresponding numbers are: about 2.2 x 101” for 
binary automata with n = 12 states, and 5.7 x 10!” for ternary automata with 
n = 8 states. 

Within the range we have considered, the only automata meeting the bound 
(n — 1)? other than the Cerny series are known examples with n < 6 states that 
were presented in [22]. 


Slowly synchronizing automata. For the case of binary automata n = 12 
states, we have obtained also the complete list of strongly connected synchro- 
nizing automata with reset length > 94. 


Table 1. The numbers of all non-isomorphic strongly connected synchronizing binary 
automata with 12 states with reset length > 94. 


eis TT l ea ereed e 


Table 1 shows the exact numbers of automata in this range, and the corre- 
sponding series according to naming from [3, 4, 16]. Here, all automata with reset 
length > 99 has a similar structure of one long cycle and a small gadget (cf. [4]), 
and they can be generalized to series of length n? — O(n) as well. We confirm, 
for n < 12, [4, Conjecture 1], which is a generalization of the Cerny conjecture, 
describing all binary synchronizing automata with reset length > n? — 4n + 8 
(104 for n = 12) and stating that up to isomorphism this list is complete. 

As observed in [4, 16,22], there are gaps in the set of possible reset lengths 
near the Cerny bound (n — 1)?. We confirm for binary automata that for n = 
6,7,8 there is one gap, for n = 9,10 there are two gaps, and for n = 11,12 there 
are three gaps. 

There is no binary strongly connected automaton with 12 states and reset 
length 95, but we have constructed such an automaton over a ternary alphabet 
(Fig. 1). Similarly, we know an automaton for n = 9 with reset length 53 (second 
gap), and for n = 11 with reset length 79 (third gap). This shows that the gaps, 
except the first one, are not necessarily preserved over larger alphabets. 


Extending words in one-cluster automata. One-cluster automata are an 
important class of synchronizing automata for which a quadratic bound on reset 
length has been found [5,21]. 

Despite several attempts [5, 8, 10, 20, 21] at improving the bounds, so far, the 
Cerny conjecture has been proved only for one-cluster automata with a cycle of 
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Fig. 1. An irreducibly synchronizing strongly connected ternary automaton with 12 
states and reset length 95. 


length n (circular automata) or with a prime-length cycle. In [21] an algebraic 
argument making use of ascending chain of linear subspaces and averaging trick 
has been applied. The proof is based on the claim that any subset S C C on the 
cycle C can be extended on this cycle by a word of length at most 4 + n (we 
apply here the notation of Section 2). It is demonstrated that this holds in the 
case of prime length of C. Proving it for non-prime lengths would provide the 
proof of the Černý conjecture for the whole class of one-cluster automata. 

We have exhaustively searched for small examples of one-cluster automata 
with a non-prime cycle length such that the length £+ n of extending words is 
exceeded for some subset S, but found out that + n is sufficient in all tested 
cases, instead of the value n + £ + |C| — D*(|C|,|S|) used to prove the bound 
from Theorem 2. Also, we found out that we can always use an extending word 
of the form waf with |w| < n, which is the form used in the proof for prime |C]. 


Conjecture 1. Let A be a one-cluster synchronizing automaton with a one- 
cluster letter a with the cycle C and level £. For any non-empty proper subset 
S CC there is a word w such that |S(wa’)~!MC| > |S] and |w| < n. 


In all the cases tested, for any £, non-prime |C| < n, and |S| with 1 < |S| < 
|C|, we found an automaton for which we needed a word w of length exactly n. 
So, it seems that the bound |w| < n is tight. 


Worst cases for the greedy compressing algorithm. The greedy compress- 
ing algorithm is a well known approach for finding a reset word [11, 19, 24]. It 
starts from S = Q, and iteratively finds a shortest word w such that |Sw| < |S] 
and uses Sw for next iteration, until |S| = 1. The concatenated words w form the 
found reset word. The length of the resulted reset word can vary, since there is 
ambiguity in selection of shortest words w. By bounding the length of the found 
reset word we also obtain an upper bound for the reset length, and in fact, the 
upper bound (n? —n)/6 for the reset length is obtained by bounding the lengths 
of words w for |S| = 2,...,n and summing these bounds [19]. It is known that 


Experiments with Synchronizing Automata 7 


this algorithm finds a word of length R(n? logn) for the Cerny automaton [15], 
but it was not clear whether it is the worst case example. 

We experimentally tested the greedy algorithm for the worst cases. Here, we 
restricted the studied class to irreducibly synchronizing automata, as otherwise 
we would get a lot of trivial examples derived from automata over a smaller 
alphabet. By the worst case length we mean the maximum length of the found 
word by the algorithm over all selections of shortest compressing words that can 
be taken by the algorithm. For example, for automaton G; from Fig. 2, the worst 
case length is 19 and a sequence of subsets considered by the greedy algorithm 
in the worst case can be the following: 


Q = {1,2,3,4,5} + {1,2,4,5} “4 {3, 4,5} ZSS 41, 4} mee? £3). 


While this requires potentially very expensive computation, the worst case length 
can be computed by a kind of dynamic algorithm and n—1 iterations of breadth- 
first search in the power automaton. 

It may be surprising that the Cerny automata generally do not exhibit the 
worst case length. We have observed that for some values n > 10 the slowly 
synchronizing series W,, 2%, and GY, (see [3,4]) exceed the worst case length 
of the Cerny automaton with the same number of states. In addition, we have 
found out four particular ternary examples shown in Fig. 2 exceeding the worst 
case length of the Černý automaton with the same number of states, which do 
not seem generalizable to series. Up to isomorphism, there are no more such 
examples within the range we have considered (Table 2). 

The results we have collected do not allow to state a reasonable conjecture. 
So far, Ýn is the best candidate for the largest worst case lengths for n > 10, 
and the Černý automata for n < 9, except Gı and Gz from Fig. 2 for n = 5,6. 


Problem 2. What are the largest worst case lengths of the greedy compressing 
algorithm of automata with n states? 


It it noticeable that the dual greedy extending algorithm, which starts from a 
singleton and uses shortest extending words rather than compressing ones, seem 
to have generally larger worst case lengths. For example, for the case of binary 
n = 7 in the worst case it can find a reset word of length 48 for some strongly 
connected automaton, whereas the greedy compressing algorithm finds a word 
of length at most 43. 


Aperiodic synchronizing automata. Recall that an automaton is aperiodic 
if there is no word inducing a transformation with a cycle of length > 2 (the 
transition semigroup has only trivial subgroups). In [24] Volkov mentioned that 
although a quadratic upper bound for the reset length of aperiodic synchronizing 
automata has been proved, the largest reset length for known aperiodic automata 
does not exceed n + |n/2| — 2. This length is reached by a series of binary 
automata constructed by Ananichev [2]. In this connection, it may be interesting 
to know that the same bound is also reached for every n > 1 by a series of 
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Fig. 2. Automata Gi, Gz, G3, and G4, with the worst case length 19, 30, 28, and 28, 
and reset lengths 15, 22, 20, and 20, respectively. 


irreducibly ternary aperiodic automata. It has a quite simple definition and an 
easy proof for the reset length (comparing with [2]). Let A, = (Q, {a,b,c}, 6), 
where Q = {v1,..., Un}, Olvi a) = ui41 for 1 < i < n — 2, (vi, b) = v;-1 for 
2<i<n-1, d(u[ns2},c) = Un, and (vi, x) = vi, otherwise (x € X) (shown 
in Fig. 3). 


Fig. 3. A ternary irreducibly synchronizing n-state aperiodic automaton with the reset 
length n + |n/2| — 2. 


Volkov! has also pointed out that n— 1 may be an upper bound for the reset 
length in the class of strongly connected synchronizing aperiodic automata, but 
there was not enough evidence. The bound can be met trivially if the underlying 
digraph of the automaton is a bidirectional path: Q =1,...,n, for every 1 <i < 


1 personal communication 
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n — 1 there are the directed edges (7,2 + 1) and (i + 1,7), and every edge that is 
not a loop is of that form. 

Since our verifications involve a huge number of aperiodic automata, we ex- 
perimentally support the following conjectures: 


Conjecture 3 (cf. [24]). Every synchronizing aperiodic automaton with n > 1 
states has a reset word of length at most < n+ [n/2] — 2. 


Conjecture 4 (Volkov). Every strongly connected synchronizing automaton 
has a reset word of length at most n — 1. Moreover, if this bound is met, then 
the underlying digraph of the automaton is a bidirectional path. 


Avoiding states. In a recent short note [13] the authors state the following 
problem related to the recent unsuccessful attempt of improving the general 
upper bound on reset length [23]: Given a strongly connected synchronizing 
automaton, what is the minimal length £ such that for any q € Q there is a word 
w of length < £ and such that q ¢ Qu. If £ € O(n), then we would obtain a 
better upper bound than (n? — n)/6. 

Experimentally, we have found out what the value of £ for a given n might 
be, and provided support for the following conjecture: 


Conjecture 5. In a synchronizing strongly connected automaton, for any q E€ Q 
there is a word w of length < 2n — 2 and such that q ¢ Qw. This bound is tight 
for n > 4 over a ternary alphabet. 


Recently, Vojtěch Vorel? discovered an infinite series of binary automata whose 
minimal length in question is 2n — 4, which is currently the best theoretical lower 
bound for the problem. 


New rank conjecture. Pin [19] proposed the following generalization of the 
Cerny conjecture: For every 0 < d,n, if there is a word of rank < n — d, then 
there is such a word of length < d?. Pin proved this for d < 3. However, Kari [14] 
found a celebrated counterexample to this conjecture for d = 4, which is a binary 
automaton K with 6 states (Fig. 4). As a consequence, a modification of this 
generalized conjecture was proposed restricting it to d being the rank of the 
considered automaton (see for example [1]). However this seems to be a quite 
radical restriction. 

In our computations, we have found no other counterexample to Pin’s con- 
jecture except for trivial extensions and modifications. This may suggest that 
Kari construction works due to the number of involved states small enough, and 
is, in fact, an exception. By a trivial extension of an automaton over alphabet 
X we mean one obtained by adding letters to X that acts either as the identity 
transformation or as any letter in X. So a trivial extension has the same number 
of the states and the transition semigroup, and trivial extensions of the Kari 
automaton K are counterexamples to the Pin’s conjecture, for d = 4, as well. By 


2 personal communication, unpublished 
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Fig. 4. The Kari automaton K [14], and a Kari-like automaton K’. 


a disjoint union of two automata A = (Q, 2,6) and A’ = (Q’, 5’, 6’) we mean 
the construction where the automata have the same alphabet X, and disjoint 
sets of states Q, Q’, and the union is simply A = (QUQ’, X,U ð’). If we take a 
disjoint union of K with any permutation automaton (one whose letters act like 
permutations, or in other words, one of rank equal to its size), then again we get 
a counterexample to the Pin’s conjecture, for d = 4. Yet, in all these automata 
the failure is caused by the same Kari construction on the set of the 6 states. 
In our experiments, we have discovered no other counterexample. This may be 
treated as an evidence for the conjecture we state below. 

Consider the smallest class of automata containing K and closed on tak- 
ing trivial extension and disjoint union with permutation automata. Let us call 
automata in this class Kari-like automata (see Fig. 4). Then we have 


Conjecture 6. For every d, if an automaton A has a word of rank at most 
n — d, then there is such a word of length at most d?, unless A is a Kari-like 
automaton and d = 4 (in which case there is a word of rank n — 4 of length 
d?+1=17). 


Subset synchronization. The last conjecture was posed by Angela Cardoso: 


Conjecture 7 (Cardoso [7]). In a synchronizing automaton, for any subset S' 
of states there is a word w with |Sw| = 1 of length at most 


oof] ese) 


This is another generalization of the Cerny conjecture, and it can be viewed 
as a counterpart for the rank conjecture, where we bound the length of words 
compressing Q to a subset of the given size, rather than a subset to a singleton. 

Conjecture 7 has been proved for several special classes of automata, and the 
formula is tight for any subset size in the Černý series. Besides confirmation for 
small automata, we identified 18 particular examples of irreducibly synchronizing 
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automata with n € {3,4,5,6} states meeting the bound for some subset S$ that 
are not isomorphic to the Cerny automata. Note that the conjecture is not true 
in general for non-synchronizing automata, as Vorel [25] has constructed a series 
of non-synchronizing strongly connected binary automata with subsets whose 
shortest synchronizing words are of exponential length. 


3.1 Summary 


Table 2 summarizes the ranges for which the discussed conjectures have been 
confirmed or the problems checked. The ranges vary due to different numbers of 
automata that have to be checked, computational complexity of verification for 
a single automaton, and computation time devoted for each of the problems. 


Table 2. Experimental verification of conjectures. The numbers denote the size of the 
alphabet up to which the given conjecture has been checked. The symbol oo denotes 
that the problem has been verified for all automata with the given number of states 
and any number of letters. 


Problem 
4] 5] 6] 7] 8] 9 [iour] 


(Gong conjecture ands Cone [= [>] [4] 5 [2 [2 [2 [2 
Conjecture Tomes p= fsfafs[2t2t2t |_| 
Problem 2 (geady eoim Jefo a e ee |_| 
Coas wii O e e e e ee |_| 


Conjecture (strongly conmeaed aperodið| | 8 |5 |3 12 12 1212] 
Conjecture 5 avoiding sare) (ols Jala eee] | 
Conjoture 6 (new rank conje) [oof 5131322] | 
Conjecture 7 (subset synchronization) ||| 5 [a32 [2| | 
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